INAM - A Scalable InfiniBand Network Analysis and Monitoring Tool

نویسندگان

  • N. Dandapanthula
  • Hari Subramoni
  • Jérôme Vienne
  • Krishna Chaitanya Kandalla
  • Sayantan Sur
  • Dhabaleswar K. Panda
  • Ron Brightwell
چکیده

InfiniBand’s popularity in the field of cluster and high performance computing can be attributed to its open standard and high performance. As InfiniBand (IB) clusters grow in size and scale, predicting the behavior of the IB network in terms of link usage and performance becomes an increasingly challenging task. Although the IB specification proposes a detailed subnet management infrastructure to handle various aspects of the network, there currently exists no open source tool that allows users to dynamically analyze and visualize the communication pattern and link usage in the IB network. In this context, we design and develop a scalable InfiniBand Network Analysis and Monitoring tool INAM. INAM monitors IB clusters in real time and queries the various subnet management entities in the IB network to gather the various performance counters specified by the IB standard. We provide an easy to use web-based interface to visualize performance counters and subnet management attributes of a cluster in an on-demand basis. It is also capable of capturing the communication characteristics of a subset of links in the network, thereby allowing users to visualize and analyze the network communication characteristics of a job in a high performance computing environment. Our experimental results show that INAM is able to accurately visualize the link utilization as well as the communication pattern of target applications.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

INAM2: InfiniBand Network Analysis and Monitoring with MPI

Modern high-end computing is being driven by the tight integration of several hardware and software components. On the hardware front, there are the multi-/many-core architectures (including accelerators and co-processors) and high-end interconnects like InfiniBand that are continually pushing the envelope of raw performance. On the software side, there are several high performance implementati...

متن کامل

A Scalable InfiniBand Network Topology-Aware Performance Analysis Tool for MPI

Over the last decade, InfiniBand (IB) has become an increasingly popular interconnect for deploying modern supercomputing systems. As supercomputing systems grow in size and scale, the impact of IB network topology on the performance of high performance computing (HPC) applications also increase. Depending on the kind of network (FAT Tree, Tori, or Mesh), the number of network hops involved in ...

متن کامل

IBPM: An Open-Source-Based Framework for InifiniBand Performance Monitoring

In this paper, we present a tool for performance measurement of InfiniBand networks. Our tool analyzes the network and presents a comprehensible visualization of the performance and health of the network. InfiniBand network operators can use the tool to detect potential bottlenecks and optimize the overall performance of their network.

متن کامل

Using InfiniBand for a scalable compute infrastructure

............................................................................................................................................. 2 Introduction ......................................................................................................................................... 2 InfiniBand technology .................................................................................

متن کامل

Impact of Next-Generation I/O Architectures on the Design and Performance of Network Servers

The increasing demand for scalable and highly-available network services has challenged computer architects to develop new I/O architecture for servers. The recently released InfiniBand industry standard, for instance, provides scalable bandwidth and protected I/O communication using the memory-mapped communication model and a channelbased switched fabric technology. Advanced device controllers...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011